Compressive statistical learning with random feature moments
نویسندگان
چکیده
We describe a general framework --compressive statistical learning-- for resource-efficient large-scale learning: the training collection is compressed in one pass into low-dimensional sketch (a vector of random empirical generalized moments) that captures information relevant to considered learning task. A near-minimizer risk computed from through solution nonlinear least squares problem. investigate sufficient sizes control generalization error this procedure. The illustrated on compressive PCA, clustering, and Gaussian mixture Modeling with fixed known variance. latter two are further developed companion paper.
منابع مشابه
Compressive Statistical Learning with Random Feature Moments
We describe a general framework –compressive statistical learning– for resource-efficient largescale learning: the training collection is compressed in one pass into a low-dimensional sketch (a vector of random empirical generalized moments) that captures the information relevant to the considered learning task. A near-minimizer of the risk is computed from the sketch through the solution of a ...
متن کاملCompressive Feature Learning
This paper addresses the problem of unsupervised feature learning for text data. Our method is grounded in the principle of minimum description length and uses a dictionary-based compression scheme to extract a succinct feature set. Specifically, our method finds a set of word k-grams that minimizes the cost of reconstructing the text losslessly. We formulate document compression as a binary op...
متن کاملCompressive Reinforcement Learning with Oblique Random Projections
Compressive sensing has been rapidly growing as a non-adaptive dimensionality reduction framework, wherein high-dimensional data is projected onto a randomly generated subspace. In this paper we explore a paradigm called compressive reinforcement learning, where approximately optimal policies are computed in a lowdimensional subspace generated from a high-dimensional feature space through rando...
متن کاملSmall Statistical Models by Random Feature Mixing
The application of statistical NLP systems to resource constrained devices is limited by the need to maintain parameters for a large number of features and an alphabet mapping features to parameters. We introduce random feature mixing to eliminate alphabet storage and reduce the number of parameters without severely impacting model performance.
متن کاملStatistical moments of the random linear transport equation
This paper deals with a numerical scheme to approximate the mth moment of the solution of the one-dimensional random linear transport equation. The initial condition is assumed to be a random function and the transport velocity is a random variable. The scheme is based on local Riemann problem solutions and Godunov’s method. We show that the scheme is stable and consistent with an advective-dif...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematical statistics and learning
سال: 2021
ISSN: ['2520-2316', '2520-2324']
DOI: https://doi.org/10.4171/msl/20